Nonnegative Principal Component Analysis for Proteomic Tumor Profiles
نویسنده
چکیده
Identifying cancer molecular patterns with high accuracy from high-dimensional proteomic pro les presents a challenge for statistical learning and oncology research. In this study, we develop a nonnegative principal component analysis and propose a nonnegative principal component analysis based support vector machine with a sparse coding to conduct e ective feature selection and high-performance proteomic pattern classi cation. We demonstrate the superiority of our algorithm by comparing it with six peer algorithms on four benchmark proteomic tumor pro les under 100 trials of 50% holdout cross validations. We also rigorously show that the overtting problem associated with support vector machines can be overcome by nonnegative principal component analysis with exceptional sensitivities and speci cities. Moreover, we illustrate that nonnegative principal component analysis can be employed to capture meaningful biomarkers.
منابع مشابه
Study of Physical and Chemical Soil Properties Variations Using Principal Component Analysis Method in the Forest, North of Iran
The field study was conducted in one district of Educational-Experimental forest at Tehran University (Kheirood-Kenar forest) in the North of Iran. Eighty-five soil profiles were dug in the site of study and several chemical and physical soil properties were considered. These factors included: soil pH, soil texture, bulk density, organic carbon, total nitrogen, extractable phosphorus and depth ...
متن کاملProteomic signatures for histological types of lung cancer.
We performed proteomic studies on lung cancer cells to elucidate the mechanisms that determine histological phenotype. Thirty lung cancer cell lines with three different histological backgrounds (squamous cell carcinoma, small cell lung carcinoma and adenocarcinoma) were subjected to two-dimensional difference gel electrophoresis (2-D DIGE) and grouped by multivariate analyses on the basis of t...
متن کاملDocument clustering using nonnegative matrix factorization q
A methodology for automatically identifying and clustering semantic features or topics in a heterogeneous text collection is presented. Textual data is encoded using a low rank nonnegative matrix factorization algorithm to retain natural data nonnegativity, thereby eliminating the need to use subtractive basis vector and encoding calculations present in other techniques such as principal compon...
متن کاملMethodology and Theory for Nonnegative-score Principal Component Analysis
We develop nonparametric methods, and theory, for analysing data on a random p-vector Y represented as a linear form in a p-vector X, say Y = AX, where the components of X are nonnegative and uncorrelated. Problems of this nature are motivated by a wide range of applications in which physical considerations deny the possibility that X can have negative components. Our approach to inference is f...
متن کاملDifferential Lipid Profiles of Normal Human Brain Matter and Gliomas by Positive and Negative Mode Desorption Electrospray Ionization – Mass Spectrometry Imaging
Desorption electrospray ionization-mass spectrometry (DESI-MS) imaging was used to analyze unmodified human brain tissue sections from 39 subjects sequentially in the positive and negative ionization modes. Acquisition of both MS polarities allowed more complete analysis of the human brain tumor lipidome as some phospholipids ionize preferentially in the positive and others in the negative ion ...
متن کامل